WHAT IS CLAIMED IS: 



1. 



A processing core that executes a parallel multiply accumulate 



2 



operation, the processing core comprising: 



3 



a first input operand register comprising a plurality of first input operands; 
a second input operand register comprising a plurality of second input 



4 



5 



operands; 



6 



a third input operand register comprising a plurality of third input 



7 operands; 

8 a plurality of functional blocks that each perform a multiply accumulate 

9 operation; 

10 an output operand register comprising a plurality of output operands, 

1 1 wherein each of the plurality of output operands is related to one of the plurality of first 

12 input operands, one of the plurality of second input operands and one of the plurality of 

13 third input operands. 

1 2. The processing core as set forth in claim 1, wherein each of the 

2 plurality of functional blocks produce a result equal to multiplying the first input operand 

3 with the second input operand to produce a product and adding the third input operand to 

4 that product. 

1 3. The processing core as set forth in claim 2, wherein the product is 

2 truncated before adding the product to the third input operand. 

1 4. The processing core as set forth in claim 2, wherein the product is 

2 rounded before adding the product to the third input operand. 

1 5. The processing core as set forth in claim 1, wherein a very long 

2 instruction word includes a plurality of parallel multiply accumulate operations. 

1 6. The processing core as set forth in claim 1, wherein the first 

2 through third input operands are at least one of following formats: integer, floating-point, 

3 fixed-point, and two's complement. 
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1 7. The processing core as set forth in claim 1, wherein the first 

2 through third input operands are at least one of: 128 bit values, 64 bit values, 32 bit 

3 values, 16 bit values, and 8 bit values. 

1 8. A method for determining a plurality of output operands, the 

2 method comprising: 

3 loading a multiply accumulate instruction; 

4 decoding the multiply accumulate instruction; 

5 loading a plurality of multiplicands, a plurality of multipliers and a 

6 plurality of accumulate values; 

7 processing a first multiplicand, a first multiplier and a first accumulated 

8 value in order to produce a first output operand; 

9 processing a second multiplicand, a second multiplier and a second 

1 0 accumulated value in order to produce a second output operand; and 

1 1 storing the first output operand and second output operand. 

1 9. The method of claim 8, further comprising: 

2 loading a second multiply accumulate instruction; 

3 decoding the second multiply accumulate instruction; 



4 loading a second plurality of multiplicands, a second plurality of 

5 multipliers and a second plurality of accumulate values; 

6 processing a third multiplicand, a third multiplier and a third accumulated 

7 value in order to produce a third output operand; 

8 processing a fourth multiplicand, a fourth multiplier and a fourth 



9 accumulated value in order to produce a fourth output operand; 

1 0 storing the third output operand and fourth output operand; and 

1 1 concatenating the two of the first through fourth output operands together 

12 to form a larger output operand. 

1 1 0. The method of claim 8, wherein the multiply accumulate 



2 instruction and the second multiply accumulate instruction are included in the same very 

3 long instruction word (VLIW). 

1 11. The method of claim 8, further comprising: 
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2 determining if the first output operand is larger than a predetermined 

3 value; and 

4 replacing the first output operand with a predetermined constant. 

1 12. The method of claim 1 1 , wherein the predetermined value is equal 

2 to a largest or smallest value for the first and second output operand. 

1 1 3. A method for determining a plurality of output operands, the 

2 method comprising: 

3 loading a multiply accumulate instruction; 

4 decoding the multiply accumulate instruction; 

5 loading a plurality of multiplicands, a plurality of multipliers and a 

6 plurality of accumulate values; 

7 determining a first result equal to a product of a first multiplicand and a 

8 first multiplier plus a first accumulate value; and 

9 storing the first result. 

1 14. The method of claim 13, further comprising: 

2 loading a second multiply accumulate instruction; 

3 decoding the second multiply accumulate instruction; 

4 loading a second plurality of multiplicands, a second plurality of 

5 multipliers and a second plurality of accumulate values; 

6 determining a second result equal to a product of a second multiplicand 

7 and a second multiplier plus a second accumulate value; 

8 storing the second result; and 

9 arranging the first result and second result into an unified result occupying 
1 0 more bits than either the first or second results. 

1 15. The method of claim 13, wherein the multiply accumulate 

2 instruction and the second multiply accumulate instruction are included in the same very 

3 long instruction word (VLIW). 

1 16. The method of claim 13, further comprising: 

2 determining a second result equal to a product of a second multiplicand 

3 and a second multiplier plus a second accumulate value; and 

4 storing the second result. 
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1 1 7. The method of claim 13, further comprising: 

2 determining if the first result is larger than a predetermined value; and 

3 replacing the first result with a predetermined constant. 

1 18. The method of claim 1 7 , wherein the predetermined value is equal 

2 to a largest or smallest value for the first and second output operand. 
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